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Abstract. Weighted automata are nondeterministic automata with nu- 
merical weights on transitions. They can define quantitative languages L 
that assign to each word w a real number L(w). In the case of infinite 
words, the value of a run is naturally computed as the maximum, lim- 
sup, liminf, limit average, or discounted sum of the transition weights. 
We study expressiveness and closure questions about these quantitative 
languages. 

We first show that the set of words with value greater than a threshold 
can be non-cj-regular for deterministic limit-average and discounted-sum 
automata, while this set is always ^-regular when the threshold is isolated 
(i.e., some neighborhood around the threshold contains no word). In the 
latter case, we prove that the w-regular language is robust against small 
perturbations of the transition weights. 

We next consider automata with transition weights or 1 and show that 
they are as expressive as general weighted automata in the limit-average 
case, but not in the discounted-sum case. 

Third, for quantitative languages L\ and L2, we consider the operations 
max(ii, L2), min(Li, L2), and 1 — L\, which generalize the boolean oper- 
ations on languages, as well as the sum L\ + L2 . We establish the closure 
properties of all classes of quantitative languages with respect to these 
four operations. 



1 Introduction 

A boolean language L can be viewed as a function that assigns to each word 
w a boolean value, namely, L(w) = 1 if the word w belongs to the language, 
and L(w) = otherwise. Boolean languages model the computations of reactive 
programs. The verification problem "does the program A satisfy the specifica- 
tion BT' then reduces to the language- inclusion problem "is La Q Lb?", or 
equivalcntly, "is La(w) < Lb{w) for all words 10?", where La represents all 
behaviors of the program, and Lb contains all behaviors allowed by the spec- 
ification. When boolean languages are defined by finite automata, this elegant 
framework is called the automata-theoretic approach to model-checking [VW86]. 

In a natural generalization of this framework, a cost function assigns to each 
word a real number instead of a boolean value. For instance, the value of a word 
(or behavior) can be interpreted as the amount of some resource (e.g., memory 



consumption, or power consumption) that the program needs to produce it, 
and a specification may assign a maximal amount of available resource to each 
behavior, or bound the long-run average available use of the resource. 

Weighted automata over semirings (i.e., finite automata with transition 
weights in a semiring structure) have been used to define cost functions, 
called formal power series for finite words [Sch61,KS86] and w-series for infinite 
words [CK94,DK03,EK04]. In [CDH08], we study new classes of cost functions 
using operations over rational numbers that do not form a semiring. We call 
them quantitative languages. We set the value of a (finite or infinite) word w 
as the maximal value of all runs over w (if the automaton is nondeterministic, 
then there may be many runs over w), and the value of a run r is a function 
of the (finite or infinite) sequence of weights that appear along r. We consider 
several functions, such as Max and Sum of weights for finite runs, and Sup, 
LimSup, Limlnf, limit average, and discounted sum of weights for infinite runs. 
For example, peak power consumption can be modeled as the maximum of a 
sequence of weights representing power usage; energy use can be modeled as the 
sum; average response time as the limit average [CCH + 05,CdAHS03]. Quanti- 
tative languages can also be used to specify and verify reliability requirements: 
if a special symbol _L is used to denote failure and has weight 1, while the other 
symbols have weight 0, one can use a limit-average automaton to specify a bound 
on the rate of failure in the long run [CGH+08]. The discounted sum can be used 
to specify that failures happening later are less important than those happening 
soon [dAHM03] . 

The quantitative language-inclusion problem "Given two automata A and £?, 
is La(w) < Lb(w) for all words w?" can then be used to check, say, if for each 
behavior, the peak power used by the system lies below the bound given by the 
specification; or if for each behavior, the long-run average response time of the 
system lies below the specified average response requirement. s In [CDH08], we 
showed that the quantitative language-inclusion problem is PSPACE-complctc 
for Sup-, LimSup-, and Limlnf-automata, while the decidability is unknown for 
(nondeterministic) limit-average and discounted-sum automata. We also com- 
pared the expressive power of the different classes of quantitative languages and 
showed that nondeterministic automata arc strictly more expressive in the case 
of limit-average and discounted-sum. 

In this paper, we investigate alternative ways of comparing the expressive 
power of weighted automata. First, we consider the cut-point languages of 
weighted automata, a notion borrowed from the theory of probabilistic au- 
tomata [Rab63]. Given a threshold 77 e R, the cut-point language of a quan- 
titative language L is the set of all words w with value L(w) > r], thus a 
boolean language. We show that deterministic limit-average and discounted-sum 
automata can define cut-point languages that are not w-regular. Note that there 
exist aj-rcgular languages that cannot be expressed as a cut-point language of 
a limit-average or discounted-sum automaton [CDH08]. Then, wc consider the 
special case where the threshold 77 is isolated, meaning that there is no word with 
value in the neighborhood of 77. We argue that isolated cut-point languages have 
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stability properties, by showing that they remain unchanged under small pertur- 
bations of the transition weights. Furthermore, we show that every discounted- 
sum automaton with isolated cut-point defines an w-regular language, and the 
same holds for deterministic limit-average automata. This question is open for 
nondeterministic limit-average automata. Finally, we consider a boolean coun- 
terpart of limit-average and discounted-sum automata in which all transitions 
have weight or 1. Of special interest is a proof that limit-average automata 
with rational weights in the interval [0, 1] can be reduced to automata with 
boolean weights. Therefore, the restriction to boolean weights does not change 
the class of quantitative languages definable by limit-average automata; on the 
other hand, we show that it reduces that of discounted-sum automata. 

In the second part of this paper, we study the closure properties of quanti- 
tative languages. It is natural and convenient to decompose a specification or a 
design into several components, and to apply composition operations to obtain a 
complete specification. We consider a natural generalization of the classical op- 
erations of union, intersection and complement of boolean languages. We define 
the maximum, minimum, and sum of two quantitative languages L\ and L2 as 
the quantitative language that assigns max(Li(iz;), L2(w)), min(Z/i(w), Li2(w)), 
and L\(w) + L2(w) to each word w. The complement L c of a quantitative lan- 
guage L is defined by L c (w) = 1 — L(w) for all words w. 3 The sum is a natural 
way of composing two automata if the weights represent costs (e.g., energy con- 
sumption). We give other examples in Section 2 to illustrate the composition 
operations and the use of quantitative languages as a specification framework. 

We give a complete picture of the closure properties of the various classes 
of quantitative languages (over finite and infinite words) under maximum, mini- 
mum, complement and sum (see Table 1). For instance, limit- average automata 
are not closed under sum and complement, while nondeterministic discounted- 
sum automata are closed under sum but not under complement. All other classes 
of weighted automata are closed under sum. For infinite words, the closure prop- 
erties of Sup-, LimSup-, and Limlnf-automata are obtained as a direct extension of 
the results for the boolean finite automata, while for LimAvg- and Disc-automata, 
the proofs respectively require the analysis of the structure of the automata 
cycles and properties of the solutions of polynomials with rational coefficients. 
Note that the quantitative language-inclusion problem "is La(w) < Lb(w) for all 
words w?" reduces to closure under sum and complement because it is equivalent 
to the question of the non-existence of a word w such that La[w) + L c B (w) > 1, 
that is an emptiness question which is dccidablc for all classes of quantitative lan- 
guages [CDH08] . Also note that deterministic limit-average and discounted-sum 
automata are not closed under maximum, which implies that nondeterministic 
automata are strictly more expressive in these cases (because the maximum can 
be obtained by an initial nondeterministic choice). 

Related work. Functions such as limit average (or mean payoff) and discounted 
sum have been studied extensively in the branching-time context of game the- 

3 One can define L c (w) — k — L(w) for any constant k without changing the results 
of this paper. 
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ory [Sha53,EM79,Con92,ZP96,CdAHS03]. It is therefore natural to use the same 
functions in the linear-time context of languages and automata. 

Weighted automata with discounted sum have been considered in [DR07], 
with multiple discount factors and a boolean acceptance condition (Muller or 
Buchi); they are shown to be equivalent to a weighted monadic second-order 
logic with discounting. Several other works have considered quantitative gener- 
alizations of languages, over finite words [DG07], over trees [DKR08], or using 
finite lattices [GC03] , but none of these works has addressed the expressiveness 
questions and closure properties for quantitative languages that are studied here. 

The lattice automata of [KL07] map finite words to values from a finite lat- 
tice. The lattice automata with Buchi condition are analogous to our LimSup 
automata, and their closure properties are established there. However, the other 
classes of quantitative automata (Sum, limit-average, discounted-sum) are not 
studied there as they cannot be defined using lattice operations and finite lat- 
tices. 

2 Quantitative Languages 

A quantitative language L over a finite alphabet £ is either a mapping L : S + — > 
R or a mapping L : — > R, where R is the set of real numbers. 

Weighted automata. A weighted automaton is a tuple A = (Q, qi, S, 6, 7) 

where: 

— Q is a finite set of states, qi £ Q is the initial state, and S is a finite alphabet; 
-iCQx£xQisa finite set of labelled transitions. We assume that 6 is 

total, that is for all q £ Q and a € S, there exists (q, cr,q') £ 6 for at least 
one q' £ Q; 

— 7 : S — > Q is a weight function, where Q is the set of rational numbers. We 
assume that rational numbers are encoded as pairs of integers in binary. 

We say that A is deterministic if for all q € Q and a £ S, there exists 
(q,o~,q') £ 5 for exactly one q' £ Q. We sometimes call automata nondetermin- 
istic to emphasize that they are not necessarily deterministic. 

A run of A over a finite (resp. infinite) word w — <J\<J2 ■ ■ ■ is a finite (rcsp. 
infinite) sequence r = qa<J\qio~2 ■ ■ ■ of states and letters such that (i) q$ = qi, 
and (ii) (qi, qi+i) £ S for all < i < \w\. We denote by 7(7*) = vqvi . . . the 
sequence of weights that occur in r where Vi = *y(qi, <7j+i, qi+i) for all < i < \w\. 

Given a value function Val : Q + — > R (rcsp. Val : Q w — > R), we say that the 
Val-automaton A defines the quantitative language La such that for all w £ S + 
(resp. w £ S^): 

La{w) = sup{Val(7(r)) r is a run of A over to}. 

We consider the following value functions to define quantitative languages. 
Given a finite sequence v = v\ . . . v n of rational numbers, define 
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— Last(u) = v n ; 

— Max(u) = sup{w.j | 1 < i < n}; 

n 

— Sum(u) = y^^; 

i=i 

Given an infinite sequence v = vqVi ... of rational numbers, define 

— Sup(w) = sup{w n | n > 0}; 

— LimSup(u) = limsup v„ = lim sup{u,; | i > n}; 

n — >oo n '°° 

— Liminf (v) = liminf v n = lim inf{«j | i > n}; 

n — >oo n — >oc 

^ n— 1 

— LimAvg(u) = liminf — vf, 

n — ^oo fi — J 

i=0 

oo 

— For < A < 1, Disc A (u) = ^ A 4 • vn 

i=0 

Note that Biichi and coBiichi automata are special cases of respectively 
LimSup- and Limlnf-automata, where all weights are either or 1. 

Notations. Classes of weighted automata over infinite words are denoted with 
acronyms of the form xy where x is either N(ondeterministic), D(eterministic), 
or °yW (when deterministic automata have the same expressiveness as non- 
deterministic automata), and y is one of the following: Sup, Lsup (LimSup), 
LiNF(Limlnf), LAVG(LimAvg), or Disc. For Biichi and coBiichi automata, we 
use the classical acronyms NBW, DBW, NCW, etc. 

Reducibility. A class C of weighted automata is reducible to a class C of 
weighted automata if for every A e C there exists A' <G C such that La = La>, 
i.e. La(w) = La'(w) for all (finite or infinite) words w. In particular, a class of 
weighted automata can be determinized if it is reducible to its deterministic coun- 
terpart. Reducibility relationships for (non)dcterministic weighted automata are 
given in [CDH08]. 

Composition. Given two quantitative languages L and V over S, and a ratio- 
nal number c, we denote by max(L, L') (resp. min(L, L'), L+L' , c+L, and cL) the 
quantitative language that assigns max{L(ui), L'(w)} (resp. min{L(w), L'(w)}, 
L(w) +L'(w), c + L(w), and c- L(w)) to each word w G S + (or w £ S 1 ^). We say 
that c + L is the shift by c of L and that cL is the scale by c of L. The language 
f — L is called the complement of L. The max, min and complement operators for 
quantitative languages generalize respectively the union, intersection and com- 
plement operator for boolean languages. For instance, De Morgan's laws hold 
(the complement of the max of two languages is the min of their complement, 
etc.) and complementing twice leave languages unchanged. 

Example 1. We consider a simple illustration of the use of limit-average automata 
to model the power consumption of a motor. The automaton B in Fig. 1(b) spec- 
ifies the maximal power consumption to maintain the motor on or off, and the 
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off on 




slow 

(a) Limit-average automaton A. 



off on 




off, slow 



(b) Limit-average automaton B. 
Fig. 1. Specifications for the power consumption of a motor. A refines B, i.e. La < Lb- 

maximal consumption for a mode change. The specification abstracts away that 
a mode change can occur smoothly with the slow command. A refined specifica- 
tion A is given in Fig. 1(a) where the effect of slowing down is captured by a third 
state. One can check that La(w) < Lb(w) for all words w e {on, off,slow} u . 
Given two limit-average automata that model the power consumption of two dif- 
ferent motors, the maximal, minimal, and the sum of average power consumption 
are obtained by composing the automata under max, min and sum operations, 
respectively. 

Example 2. Consider an investment of 100 dollars that can be made in two 
banks A\ and A2 as follows: (a) 100 dollars to bank A\, (b) 100 dollars to bank 
A2, or (c) 50 dollars to bank A\ and 50 dollars to bank A^. The banks can be 
cither in a good state (denoted G\, G2) or in a bad state (denoted B\, B^)- If it 
is in a good state, then A\ offers 8% reward while A2 offers 6% reward. If it is 
in a bad state, then A\ offers 2% reward while A2 offers 4% reward. The change 
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(5i,0, 8 



(bi,-),2 




(a) 100 dollars invested in bank Ai. 
(;g 2 ),6 

(■,62), 




(■,92), 6 

(b) 100 dollars invested in bank A2. 



Fig. 2. The discounted-sum automata models of two banks. 



of state is triggered by the input symbols b\ , 62 (from a good to a bad state) 
and g\ , 172 (from a bad to a good state) . The rewards received earlier weight more 
than rewards received later due to inflation represented by the discount factor. 
The automata A\ and A2 in Figure 2 specify the behavior of the two banks 
for an investment of 100 dollars, where the input alphabet is {<7i,6i} x {(72,^2} 
(where the notation (<7i,-) represents the two letters (51,52) and (gi, 62), and 
similarly for the other symbols). If 50 dollars are invested in each bank, then 
we obtain automata C\ and Ci from A\ and Ai where each reward is halved. 
The combined automaton is obtained as the composition of C\ and C2 under 
the sum operation. 



3 Expressiveness Results for Weighted Automata 

The expressive power of weighted automata can be compared by mean of the 
reducibility relation, saying that a class C of weighted automata is at least as ex- 
pressive as a class C if every quantitative language definable by some automaton 
in C is also definable by some automaton in C . The comparison includes boolean 
languages, considering them as a special case of quantitative languages of the 
form L : E w — > {0, 1}. It was shown in [CDH08] that a wide variety of classes 
of quantitative languages can be defined by the different types of weighted au- 
tomata, depending on the value function and whether they are deterministic or 
not. This contrasts with the situation for boolean languages where most of the 
classes of automata define oj-rcgular languages. In this section, we investigate 
alternative ways of comparing the expressive power of weighted automata and 
of classical finite automata. First, we use the cut-point languages of weighted 
automata to compare with the class of w-regular languages, and then we use 
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weighted automata with boolean weights, i.e. all transitions have weight or 1, 
to compare with general weighted automata. 

3.1 Cut-point languages 

Let L be a quantitative language over infinite words and let 77 G M be a threshold. 
The cut-point language defined by (L, rj) is the (boolean) language 

L- r ' = {w e A7 W I L(w) > rj}. 

Cut-point languages for finite words are defined analogously. They have been 
first defined for probabilistic automata [Rab63] , then generalized to inverse image 
recognition for semiring automata over finite words [CMOO] . It is easy to see that 
the cut-point languages of Max- and Last-automata are regular, those of Sum- 
automata are context-free, and those of Sup-, LimSup-, and Limlnf-automata are 
w-regular. 

We show that the classes of cut-point languages definable by 
(non)detcrministic limit-average and discounted-sum automata are incom- 
parable with the w-regular languages. The result follows from Theorem 1, and 
from [CDH08, Theorems 13 and 14]. 

Theorem 1. There exists deterministic limit-average and discounted-sum au- 
tomata whose cut-point language is not to-regular. 

Proof. Consider the alphabet U = {a, b}, and consider the languages L\ that 
assigns to each word its long-run average number of a's, and Li that assigns the 
discounted sum of a's. Note that L\ is definable by a deterministic limit-average 
automaton, and Li by a deterministic discounted-sum automaton. It was shown 
in [Cha07] that the cut-point language if 1 is complete for the third level of 
the Borel hierarchy, and therefore is not w-regular. We show that L^ 1 is not 
w-regular. 

Given a finite word w € £*, let r a (w) = J2i\w i= a ^ be the discounted sum 

of a's in w. We say that w is ambiguous if 1 — < r a (w) < 1. The ambiguity 

lies in that some continuations of w (namely w.a^) are in L^ 1 and some are 
not (namely w.b^). We show that for all A > i, if w is ambiguous, then either 
w.a or w.b is ambiguous, which entails that there exists an infinite word w- all 
of whose finite prefixes are ambiguous (and L 2 {w^) = 1). To do this, assume 

that 1 — < r a (w) < 1, and show that either 1 — A 1 ^' A < r a (w.a) < 1 or 

1 — A 1 ^' A < r a (w.b) < 1. Since r a (w.a) = r a (w) + A' 1 "' and r a (w.b) = r a (w), we 

have to show that 1 - ^ < r a (w) < 1 - AH or 1 - < r a (w) < 1. This 

holds if 1 — A 1 ^' A ' < 1 — A' 1 "', which is equivalent to A > |. 

Now, we show that if there exists a nondeterministic Biichi automaton A for 
L^ 1 , then the set of states S n reached in A by reading the first n letters of w- 
( which we denote by uir^ ,) should be different for each n, i.e. n ^ m implies 



8 



S n 7^ S m . Towards a contradiction, assume that S n = S m for n < m. Then for all 
w' G S u , we have n y w> S L^ 1 if and only if wr ± m y w ' G I* 1 particular, 
for w' = wp i, this shows that LiiwP i.u>') = 1 = L-riwr, ,.u/) since 

[m+l...J" zv [l...nj ' ^ [l...mj / 

L 2 {w-) = 1 and r a (w[ 1 ... n ]) < r a (w[x„. m ]). This yields 

r a (w[i...n]) + X" ■ r a (w') = 1 = r a (w[i,„ m ]) + A™ ■ r a (w') 

that is, by eliminating r a (w'), A™-™(1 - P(A)) = 1 - Q(A) where P(A)) = 
r a{w[i...n}) an d Q(A) = f a {wu___ m A arc polynomials of respective degree n — 1 
and m — 1, and with coefficients in the set {0, 1}. First, observe that the equation 
is not identically because the coefficient of the term of degree is not (as the 
first letter of w- must be b since a is not ambiguous). Second, every coefficient 
in the equation is in the set { — 1,0,1,2}, and a classical result shows that if 
- is a solution of a polynomial equation with p and q mutually prime, then p 
divides the coefficient of degree 0, and q divides the coefficient of highest degree. 
Therefore, no rational number in the interval 1[ can be a solution. This shows 
that n m implies S„ S m , and the automaton A cannot have finitely many 
states. ■ 

We note that cut-point languages are not stable under arbitrarily small per- 
turbations of the transition weights, nor of the value of the cut-point. Consider 
the quantitative languages L\, L2 from the proof of Theorem 1. If for instance 
a limit-average automaton A assigns weight 1 + e to the a's and to the 6's, 
its cut-point language L^ 1 is clearly not different from Lf 1 that assigns to each 
word its long-run average number of a's, no matter the value of e > 0. The same 
holds with respect to L2 if A is interpreted as a discounted-sum automaton. 

In the theory of probabilistic automata, where finite words are assigned a 
probability of acceptance, the cut-point languages may also be non-regular. 
Therefore, one considers the special case where the cut-point is isolated, and 
shows that the cut-point languages are then regular [Rab63] . 

A number 77 is an isolated cut-point of a quantitative language L if there 
exists e > such that 

\L(w) -r)\>e for all w e S u . 

We show that every discounted-sum automaton with isolated cut-point de- 
fines an w-regular language, and that this also holds for deterministic limit- 
average automata. We also argue that this notion has stability properties, in 
that isolated cut-point languages remain unchanged under small perturbations 
of the transition weights. This follows from a more general result about the 
robustness of weighted automata. 

A class of weighted automata is robust if a small (syntactical) perturbation 
in the weights of an automaton induces only a small (semantical) perturbation 
in the values of the words in the quantitative language of the automaton, and 
the semantical perturbation tends to when the syntactical perturbation tends 
to 0. To formally define robustness, we need e-approximations of automata, and 
distance between quantitative languages. 



9 



Let A = (Q,qi, S,S,"f) be a (nondeterministic) weighted automaton, and 
let e G R-°. We say that a weighted automaton B = (Q 1 , q'j, S,6' ,7') is an 
e- approximation of A if 

— Q' = Q, q'j = qi, 5' = 5, and 

- \i'{q,°,q') -7(?,o-,9')l < 6 for a11 (<7) cr :9 / ) £ 

The swp-distance between two quantitative languages L-y, L2 : -S 1 " — > R is defined 
by 

D sup (L 1 ,L 2 ) = sup \Li(w) - L 2 {w)\. 
we£<* 

We say that a class C of weighted automata is uniformly robust if for all 
77 G M >0 , there exists e G M >0 such that for all automata A,BeC where 
B is an e-approximation of A, we have D sup (La, Lb) < T). Note that uniform 
robustness implies a weaker notion of robustness where a class C of weighted 
automata is called robust if for all automata A G C and for all 77 G IR >0 , there 
exists e G M >0 such that for all e-approximation B of A (with B G C), we have 
D sup (L A ,L B ) < r). 

Theorem 2. T/ie classes of (non) deterministic Sup-, LimSup-, Limlnf-, LimAvg- 
tme? Disc-aitfomata are uniformly robust. 

Proof. Let A, B be two weighted automata with B an e-approximation of A. It 
is easy to see that for Sup-, LimSup-, Limlnf- and LimAvg-automata, the value of 
a run r of B differs by at most e from the value of the same run in A. Therefore, 
D sup (La, Lb) < e and we can take e = 77. For Disc-automata, the value of a run 
of B differs by at most from the value of the same run in A, where A is the 
discount factor. Therefore, we can take e = 77(1 — A). ■ 

As a corollary of Theorem 2, for an isolated cut-point 77, the cut-point 
language L- v remains unchanged under small perturbations of the transition 
weights. 

Theorem 3. Let La be the quantitative language defined by a weighted automa- 
ton A, and let r\ be an isolated cut-point of La ■ There exists e > such that for 
all e- approximations B of A, L^ 1 = L^ 1 (where Lb is the quantitative language 
defined by B). 

Now, we show that the isolated cut-point languages of deterministic 
discountcd-sum and limit-average automata are w-regular. For nondeterministic 
automata, the same property holds in the discounted-sum case, but the question 
is open for limit average. 

Theorem 4. Let L be the quantitative language defined by a Disc- automaton. 
If 77 is an isolated cut-point of L, then the cut-point language L- v is to-regular. 
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Proof. Let A be the discount factor of the Disc-automaton that defines L. 
Since, 77 is an isolated cut-point of L, let e > such that \L(w) — 77 > e for all 
w G S u . Let n € N such that u„ = \'\ < e where = max( ?i(rg ') |<S(g, cr, 
is largest weight in A. Consider any run r in A of length n, and let 7(f) be the 
A-discounted sum of the weights along r. Then, it should be clear that 7(7") £ 
[77 — e + u n , 77 + e — u n ] , because otherwise, the value of any (infinite) continuation 
of r would lie in the interval [77 — 6,77 + 6], which would be a contradiction. 
Moreover, if 7(1*) < 77 — e + u n , then any (infinite) continuation of r has value 
less than 77, while if 7(7") > 77 + e — u„, then any (infinite) continuation of r has 
value greater than 77. Therefore, the cut-point language L- v can be defined by 
the unfolding up to length n of the Disc- automaton that defines L, in which the 
states that are reached via a path with value at least 77 + e — u n arc declared to 
be accepting, and have a self-loop on S. ■ 



Theorem 5. Let L be the quantitative language defined by a deterministic 
LimAvg- automaton. If 77 is an isolated cut-point of L, then the cut-point lan- 
guage L- v is to-regular. 

Proof. Let A be a deterministic LimAvg-automaton, defining the language L. 
Consider the SCC-dccomposition C\, C%, . . . , Ck of the underlying graph of A. 
For each 1 < i < k, let rrn and Mj be the minimal and maximal average weight 
of a cycle in Ci (those values can be computed with Karp's algorithm [Kar78]). 
It is easy to see that for every 1 < i < k, for every v £ [mi, Mi], there exists 
a word w £ S w such that L(w) = v. Therefore, since 77 is an isolated cut-point 
of L, we have 77 g" [mj,Mj] for all 1 < i < k. A DBW for L- v is obtained from 
A by declaring to be accepting all states q of A such that q 6 Ci and m,; > 77. 



3.2 Boolean weights 

We consider weighted automata with boolean set of weights, i.e. all transitions 
have weight or 1. The aim is to have a boolean counterpart to limit-average 
and discounted-sum automata, and check if this changes their expressive power. 
We show that the restriction does not change the class of quantitative languages 
definable by limit-average automata, but does reduce that of discounted-sum 
automata. 

Given a set i?CR, and a class C of nondetcrministic weighted automata, we 
denote by Cr the class of all automata in C whose weights are rational numbers 
in R. 

Theorem 6. The class of nondeterministic (resp. deterministic) LimAvg- 
automata with weights in [0, 1] HQ is reducible to the class of nondeterministic 
(resp. deterministic) LimAvg- automata with weights and 1 only. 
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Proof. Given a NLAVG [0,1] -automaton A = (Q,qi, S,S,j), we construct a 
NLAVG{ ,i}-automaton B such that La = Lb- 

First, let W = {7(9, cr, q') \ (q, a, q') G 6} be the set of weights that occur 
in A, and let ua be the smallest integer n such that for all v £ W, there exists 
e 6 N such that v = — (i.e., — is the greatest common divisor of the weights of 
A). We define B = (Q\ q' I: S, V, 7'} as follows: 

— Q' = Q x [ha] (where [ua] denotes the set {0, 1, ...,ua — 1})- Intuitively, 
when we reach a state (q, i) in B, it means that the state q was reachable 
in A and that the sum of the weights to reach q is of the form k H — — for 
some integer k. In B however, the sum of the weights to reach (q, i) will then 
be k, and we store in the discrete state the information that the remainder 
weight is Whenever this remainder exceeds 1, we introduce a weight 1 
and decrement the remainder. 

— ?/ = («i,0); 

— for each transition (q,<r,q') £ S and each value i S [ua], the following tran- 
sitions are in 5' (where v — 7(9, cr, q')): 

• ((q,i),a, (q',j)) for j = i + (v — 1) ■ Ua if ^- + v > 1; the weight of such 
a transition is 1 in 7', 

• ((c/,i),cr, (q',j)) for j = i + u • rc.yt if ^ — h w < 1; the weight of such a 
transition is in 7'. 

Note that in the above, v ■ ua is an integer and j G [ua]' 

There is a straightforward correspondence between the runs in A and the 
runs in B. Moreover, if the average weight of a prefix of length n of a run in 
A is — , then the average weight of the prefix of length n of the corresponding 
run in B is between — and ^±^-. Hence the difference tends to when n — > 00. 

n n 

Therefore, the value of a run in A is the same as the value of the corresponding 
run in B, and therefore La = Lb- 

Finally, note that if A is deterministic, then B is deterministic. ■ 

Theorem 7. The class of deterministic Disc- automata with rational weights in 
[0, 1] is not reducible to the class of (even nondeterministic) Disc- automata with 
weights and 1 only. 

Proof. Given a discount factor < A < 1, consider the NDisC[ 0i i] over E = 
{a, b} that consists of a single state with a self-loop over a with weight and 
a self-loop over b with weight 0. Let L\ be the quantitative language defined by 
this automaton. Towards a contradiction, assume that this language is defined 
by a NDiSC.ro, 1} A- First, consider the word ab^ whose value in L\ is < 1. 
This entails that A cannot have a transition from the initial state over a with 
weight 1 (as this would imply that L^(a6") > 1). Now, the maximal value that 
La can assign to the word a w is A + A 2 + A 3 + • • • = j^j which is strictly smaller 
than L\(a ul ) = 2 (i- X \) ■ This shows that A cannot exist. ■ 
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4 The Closure Properties of Weighted Automata 

We study the closure properties of weighted automata with respect to max, min, 
complement and sum. We say that a class C of weighted automata is closed under 
a binary operator op(-,-) (resp. a unary operator op'(-)) if for all A\,A2 G C, 
there exists A12 G C such that La 12 = °p(LAn La 2 ) (resp. La 12 = °p'(^Ai))- 
All closure properties that we present in this paper are constructive: when C is 
closed under an operator, we can always construct the automaton A12 G C given 
Ai,A2 G C. We say that the cost of the closure property of C under a binary 
operator op is at most 0(/(m, m\, ri2, 7712)) if for all automata A\, A2 G C with 
states and mi transitions (for i = 1,2 respectively), the constructed automaton 
A12 G C such that La 12 = op(La 1 , La 2 ) has at most 0(/(7ii, 771.1,77,2,7712)) many 
states. Analogously, the cost of the closure property of C under a unary operator 
op' is at most 0(f(n,m)) if for all automata A\ G C with n states and m 
transitions, the constructed automaton A12 G C such that La 12 = op^L^) 
has at most 0{f{n, m)) many states. For all reductions presented, the size of 
the largest weight in A12 is linear in the size p of the largest weight in A\,A2 
(however, the time needed to compute the weights is quadratic in p, as we need 
addition, multiplication, or comparison, which are quadratic operations over the 
rational numbers). 

Notice that every class of weighted automata is closed under shift by c and 
under scale by |c| for all c G Q. For Sum-automata and discounted-sum automata, 
we can define the shift by c by making a copy of the initial states and adding c 
to the weights of all its outgoing transitions. For the other automata, it suffices 
to add c to (resp. multiply by \c\) all weights of an automaton to obtain the 
automaton for the shift by c (resp. scale by \c\) of its language. Therefore, all 
closure properties also hold if the complement of a quantitative language L was 
defined as k — L for any constant k. 

Our purpose is the study of quantitative languages over infinite words. For 
the sake of completeness, we first give an overview of the closure properties for 
finite words. 

4.1 Closure properties for finite words 

We successively consider closure under max, min, complement, and sum for 
weighted automata over finite words. Table 1(a) summarizes the closure proper- 
ties of Max-, Last- and Sum-automata. 

Theorem 8. Deterministic Sup- and Last- automata are closed under max, with 
cost 0(ni -n2). Nondeterministic Sup-, Last- and Sum-automata are closed under 
max, with cost 0(ni + ?i 2 ). Deterministic Sum-automata are not closed under 
max. 

Proof. For the nondeterministic automata, the result follows from the fact 
that the max operator can be obtained by an initial nondeterministic choice 
between two quantitative automata. For deterministic Sup- and Last-automata, 
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the result follows from the fact that the classes of nondeterministic Sup- and Last- 
automata are reducible 4 to their respective deterministic counterpart. Finally, 
deterministic Sum-automata are not closed under the max operator because the 
language over S = {a, 6} that assigns to each finite word w £ S + the number 
max{L a (w), Lb(w)} where L a (w) is the number of occurrences of a in w (for 
a = a,b) is definable by the max of two deterministic-Sum languages, but not 
by a deterministic Sum-automaton (Theorem 4 in [CDH08]). ■ 



Theorem 9. Deterministic and nondeterministic Sup-automata are closed un- 
der min 7 with cost 0{n% ■ mi ■ n2 • m%). Deterministic and nondeterministic 
Last-automata are closed under min, with cost 0{n\ ■ n^). Deterministic and 
nondeterministic Sum-automata are not closed under min. 

Proof. Given two Last-automata A\ and A2 (over the same alphabet), we 
use the classical synchronized product A12 = A\ x A2, where the weight of 
a transition in A\2 is the minimum of the corresponding transition weights in 
A\ and A 2 . It is easy to see that La 12 = min(i J 4 1 , La 2 )- If A\ and A 2 are 
deterministic, then so is A\2. 

The construction for Sup-automata is the same as for Sup-automata over 
infinite words given in the proof of Theorem 14. 

Finally, for Sum-automata, consider the language L m over £ = {a, 6} that 
assigns to each finite word w G S + the value rnm.{L a (w) , Lb(w)} where L a (w) 
is the number of occurrences of a in w (for a = a, 6). We claim that L m is not 
definable by a nondeterministic Sum-automaton. Indeed, assume that the Sum- 
automaton A defines L m . First, every the sum of weights in every reachable 
cycle of A over o's must be at most 0. Otherwise, we can reach the cycle with 
a finite word w\ and obtain an arbitrarily large value for the word w\a l for i 
sufficiently large, while for such i the value of wia 1 is the number of 6's in wi 
which is independent of i. Analogously, the sum of weights in every reachable 
cycle of A over 6's must be at most 0. Now, let (3 = max e gi 7(e) be the maximal 
weight in A, and consider the word w = a n b n for n > 2/3 • |Q|. Every run of 
A over a n (or over 6") can be decomposed in possibly nested cycles (since A is 
nondeterministic) and a remaining non-cyclic path of length at most \Q\. Hence, 
the value of any run over w is at most 2/3 • \Q\. However, the value of w should 
be n, yielding a contradiction. ■ 



Theorem 10. Deterministic Last- and Sum-automata are closed under comple- 
ment, with costO(n). Nondeterministic Last-automata are closed under comple- 
ment, with cost 0(2 n ). Nondeterministic Sum automata, and both deterministic 
and nondeterministic Sup-automata are not closed under complement. 

4 We say that a class C of quantitative automata is reducible to a class C' of quantitative 
automata if for every A £ C there exists A' £ C'such that La = La' ■ 
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Proof. To define the complement of the language of a deterministic Sum (or 
Last-) automaton, it suffices to multiply all the weights by —1, and then shift 
the language by 1. For the class of nondeterministic Last-automata, the result 
follows from the fact that it is reducible to its deterministic counterpart. 

The negative result for Sup-automata follows from an analogous in the 
boolean case (consider the language L over {a, b} such that L(a l ) = for all 
i > 1, and L(w) — 1 for all words containing the letter b). Finally, according to 
the proof of Theorem 9, the language min(L a ,Lb) where L a {w) is the number 
of occurrences of a in w (for a = a,b) is not definable by a nondeterministic 
Sum-automaton. Since min(L a ,Lf,) = — max(— L a , — Lb) and (i) —L a and —Lb 
arc definable by Sum-automata, and (ii) nondeterministic Sum-automata are 
closed under max (Theorem 8), the language max(— L a , —Lb) is definable by a 
nondeterministic Sum-automaton, and the result follows. ■ 

Theorem 11. Every class of weighted automata over finite words are closed 
under sum. The cost is 0(n\ ■ TI2) for Last- and Sum -automata, and 0{n\ ■ mi ■ 
712 ■ TU2) for Sup -automata. 

Proof. It is easy to see that the synchronized product of two Last-automata 
(resp. Sum-automata) defines the sum of their languages if the weight of a joint 
transition is defined as the sum of the weights of the corresponding transitions 
in the two Last-automata (resp. Sum-automata). 

The construction for Sup-automata is the same as for Sup-automata over 
infinite words given in the proof of Theorem 24. ■ 
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Table 1. Closure properties. 



4.2 Closure under max for infinite words 

The maximum of two quantitative languages defined by nondeterministic au- 
tomata can be obtained by an initial nondeterministic choice between the two 
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automata. This observation was also made in [DR07] for discounted-sum au- 
tomata. For deterministic automata, a synchronized product can be used for 
Sup and LimSup, while for Limlnf we use the fact that NLlNF is determinizable 
with an exponential blow-up [CDH08]. 

Theorem 12. The nondeterministic Sup-, LimSup-, Limlnf-, LimAvg- and Disc- 
automata are closed under max, with cost 0(n 1 +n 2 ), the deterministic Sup- and 
LimSup-ait£oma£a with cost 0(ni ■ 712), the deterministic Limlnf -automata with 
cost 0((mi + m 2 ) • 2" 1+ " 2 ). 

Proof Sketch. For all the nondeterministic quantitative automata, the result 
follows from the fact that the max operator can be achieved with an initial 
nondeterministic choice between two quantitative automata. For DLinf, the 
result follows from the reducibility of NLinf to DLinf with an exponential 
blow-up [CDH08]. We now prove that DLsup and DSuPare closed under max 
with cost 0(ni ■ n 2 ). Given two DLsup (or DSup) A\ and A 2 over the same 
alphabet, we construct the usual synchronized product A 12 = A\ x A 2 , where 
the weight of a transition in A\ 2 is the maximum of the corresponding transition 
weights in A\ and A 2 . It is easy to see that La 12 = max(i J 4 1 , La 2 ) in both cases 



Theorem 13. The deterministic LimAvg- and D\sc-automata are not closed un- 
der max. 

Proof. The fact that DDiSC is not closed under max follows from the proof 
of Theorem 34 in [CDH08], where it is shown that the quantitative language 
max(ii, L 2 ) cannot be defined by a DDiSC, where L\ (resp. L 2 ) is the language 
defined by the DDiSC that assigns weight 1 (resp. 0) to a's and weight (resp. 
1) to 6's. 

We now show that DLavg is not closed under max. Consider the alphabet 
£ = {a, b} and the quantitative languages L a and Lb that assign the value of 
long-run average number of a's and 6's, respectively. There exists DLavg for L a 
and Lb- We show that L m = ma,x(L a , Lb) cannot be expressed by a DLAVG. By 
contradiction, assume that A is a DLavg with set of states Q that defines L m . 
Consider any reachable cycle C over a's in A. The sum of the weights of the 
cycle must be its length |C|, as if we consider the word w* = wc • (a' c ') w where 
wq is a finite word whose run reaches C, the value of w* in L m is 1. It follows 
that the sum of the weights of the cycle C must be C|. Hence, the sum of the 
weights of all the reachable cycles C over a's in A is \C\. 

Consider the infinite word w^, = (a^l • b 2 ^Y , and let Wj = (a'^' • 6 2 IQI) J . 
Since i m (u' 00 ) = |, the run of A over has value |. It follows that for all 
e > 0, there is an integer j e , such that for all j > j £ , we have 



\v)j\ 3 
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where "f(wj) is the sum of the weights of the run of A over uij. Consider a word 
Woe constructed as follows. We start with the empty word wq and the initial state 
go of A, and for all j > 0, we construct ({uj+i, qj+i) from (wj, qj) as follows: the 
state qj + i is the last state of the run of A from qj over a'^' • b 2 ^. This run has 
to contain a cycle Cj+i over a's. We set Wj+i = Wj ■ a' < ^' + ^ Ci+1 ' • b 2 '®'. Observe 
that for all j > 1, the run of A over in the segment between wj and Wj+i is 
identical to the run from qj to qj+i up to the repetition of the cycle Cj+i once 
more. The word Woo is the limit of this construction (uij is a prefix of Woo for 

all j > 0). Let aj = ELil^l- Since 1 < 1^1 < \Q\ we have i < a i < i ' IQI- 
Hence we have the following equality: — 2^j1±£Ll Hence for all e > 0, 

there exists j £ such that for all j > j £ wc have 

l(wj) > f ■ K| - £• KJ +Qj 

2 1 a, 

> - — e ■ 



3 3 \Wj | + a,- 

2 1 j 

> -7 ~£ ' 



3 3 j-(3|Q| + |Q|) 

" 3 12|Q] 

Hence we have La(^oo) > | + 12 |q| ■ Since 1 < |Cj| < |Q| for all i > 1, we have 
L m {woo) < I which is a contradiction. ■ 



4.3 Closure under min for infinite words 

The next theorems generalize the closure property under intersection of the 
boolean languages. The construction of the automaton for the min is a direct 
extension of the well-known constructions in the boolean case. 

Theorem 14. The (non) deterministic Sup-automata are closed under min, with 
cost 0(«i • n%i ■ n 2 • m 2 ), 

Proof. Let A 1 = (Q 1 , qj, S, S u 71) and A 2 = (Q 2 , qj, £, 82, 72) be two NSup. 
We construct a NSup A± 2 = (Q,qi, such that La 12 — nunl-Ly^, La 2 }- 

Let Vj = {71(e) I e G Si} be the set of weights that appear in Ai (for i = 1,2), 
and define: 

— Q = Qi x V\ x Q 2 x V 2 . Intuitively, we remember in a state (qi,Vi, q 2 , v 2 ) the 
largest weights v\, v 2 seen so far in the corresponding runs of Ai and A 2 ; 

— qi = (<?)> «min> ?? > u min) where w^ in is the minimal weight in V$ (for i = 1,2); 

— For each er G S, the set (5 contains all the triples 
((qi,v 1 ,q 2 ,v 2 ),a,(q[,v' 1 ,q' 2 ,v 2 )) such that v { G V;, (qi,a,q[) G and 
u< = max{vi, 7(Qi,o-,Qi)}: for * = M; 
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- 7 is defined by 7(((<?i, Vi, q 2 , v 2 ), a, (q[ , v[ , q^, v' 2 ))) = nrin{«i, v 2 } for each 
((qi,vi,q2,v 2 ),cr, (q[,v[, q' 2 ,v' 2 )) G 6. 

If Ai and A 2 are deterministic, then A 12 is deterministic. The result for DSup 
follows. ■ 



Theorem 15. The (non) deterministic Limlnf -automata are closed under min 
with cost 0{n\ -n 2 ), and the nondeterministic LimSup- automata with cost 0{n\ ■ 
n 2 ■ {mi + m 2 )). 

Proof. Let Ax = (Qi, q), S, Si, 71) and A 2 = (Q 2 , qj, S, S 2 , 72) be two NLsup. 
We construct a NLsup A = (Q,qi, S,S, 7) such that La = mii^L^ , La 2 }■ Let 
Vi = {71(e) I e G 5i} be the set of weights that appear in Ai (for i = 1,2). Let 
Vx U V 2 = {vi, ... ,v n } and define 

- Q = {qi}UQi xQ 2 x {1,2} x (V1UV2) (where qi £ Q\ L)Q 2 is a new state). 
Initially, a guess is made of the value v of the input word. Then, we check 
that both A\ and A 2 visit a weight at least v infinitely often. In a state 
(<?i , q 2 , j, v) of A, the guess is stored in v (and will never change along a run) 
and the value of the index j is toggled to 3 — j as soon as Aj does visit a 
weight at least v; 

- For each a G S, the set 8 contains all the triples 

• (qi 7 a 7 {qi 7 q 2 , 1, u)) such that v G Vj. U V 2 and for all i G {1, 2}, we have 

(q},o-,qi) e 

• ((qi,q2,3,v),o-,(q' 11 q 2 ,j',v 1 )) such that u' = w, {qi,o-,q'A G ^ (i = 1,2), 
and j' = 3 — j if Jj(qj, <J, q'j) > w, and j' = j otherwise. 

- 7 is defined by j(qi,(J, (q 1 ,q 2 ,l,v)) = and ^/((qi,q2,j,v),a, (q[, q 2 ,j', v')) 
is v if j ^ j' and w m i n otherwise, where i> m in is the minimal weight in V\ U V 2 . 

For DLinf, the construction is similar to the one presented in the proof of 
Theorem 12 for DLsup, where max is replaced by min. The result for NLlNF 
follows from the fact that NLinf is reducible to DLinf. ■ 



Theorem 16. The deterministic LimSup- automata are closed under min with 
costO(m -n 2 .2 mi+ ™ 2 ). 

Proof. Let A\ = {Qi, qj, E, Si, 71) and A 2 = (Q 2 , q\, S, 5 2 , 72) be two DLsup. 
We construct a DLsup A = (Q,qi, S,S,j) such that La = m\n{LA 17 La 2 }- Let 
Vi = {7i( e ) \ e e Si} be the set of weights that appear in Ai (for i = 1,2). For 
each weight v G V\ U V 2 = {vi, . . . , v n }, we construct a DBW A\ 2 with accepting 
edges. The automaton A v 12 consists of a copy of Ai and a copy of A 2 . We switch 
from one copy to the other whenever an edge with weight at least v is crossed. 
All such switching edges are accepting in A\ 2 . The automaton A then consists 
of the synchronized product of these DBW, where the weight of a joint edge is 
the largest weight v for which the underlying edge in A\ 2 is accepting. Formally, 
let 
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— Q = Qi x Q 2 x {1,2}™ where m = \Vi UV 2 \; 

~ Qi — , 61 , . . . , 6 m ) where &j = 1 for all 1 < i < m; 

— S contains all the triples ((gi, q 2 , 61, . . . , b m ), a, (q[, q' 2 , b' 1} . . . , b' m )) such that 
a G S and 

• (q l ,a,q' l ) G 8 t for i = 1,2; 

• for all 1 < j < to, we have b' rj ■ = 3 — fr, if 7^ (g^. , <r, g h . ) > Vj, and = bj 
otherwise. 

— 7 assigns to each transition ((qi, qi, 61, . . . , 6 m ), cr, (gj , g^, &i, ■ ■ ■ , b' m )) G i5 the 
weight v = max({n m i„} U {vj \ bj ^ bj}) where u m i n is the minimal weight 
in Vi U V 2 . 



On the negative side, the (deterministic or not) limit-average and discounted- 
sum automata are not closed under min. The following lemma establishes the 
result for limit average. 

Lemma 1. Consider the alphabet S = {a, b}, and consider the languages L a 
and Li, that assigns the long-run average number of a 's and b 's, respectively. 
Then the following assertions hold. 

1. There is no NLavg for the language L rn = min{L a , Lb}. 

2. There is no NLavg for the language L* = 1 — max{L a , Lf,}. 

Proof. To obtain a contradiction, assume that there exists a NLavgA (for 
either L m or L*). We first claim that there must be either an a-cycle or a b- 
cycle C that is reachable in A such that the sum of the weights in C is positive. 
Otherwise, if for all a-cycles and ^-cycles we have that the sum of the weights is 
zero or negative, then we fool the automaton as follows. Let (3 be the maximum 
of the absolute values of the weights in A, and let a = . Then consider the 
word w = (a 5 '" '^' • b 5 ' 01 '^) 1 ^ . For a run r of A over w, the long-run average of 
the weights is bounded as follows: 

4-/HQI < 2 

10-a- \Q\ ~ 5' 

The above bound is as follows: in the run over a 5 ' a '^ , there can be a prefix of size 
at most |Q| with sum of weights at most \Q\ -/3, and then there would be a-cycles, 
and then a trailing prefix of size at most |Q| with sum of weights at most \Q\ - p. 
Similar argument holds for the segment of b 5 ' a '^. Hence La(w) < |, however, 
L m (w) = L*(w) = i, i.e., we have a contradiction. W.l.o.g., we assume that 
there is an a-cyclc C such that sum of weights of C is positive. Then we present 
the following word w: a finite word wc to reach the cycle C, followed by a"; the 
answer of the automaton is positive, i.e., La{w) > 0, while L m {w) = L*(w) = 0. 
Hence the result follows. ■ 
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Theorem 17. The (non) deterministic LimAvg- automata are not closed under 
min. 



Proof. The result follows from Lemma 1 and the fact that there exists DLavg 
for the languages L a and Lb of Lemma 1. I 

Finally, we show that discounted-sum automata are not closed under min. 
Theorem 18. The (non) deterministic Disc- automata are not closed under min. 



Proof. Let A be a non-algebraic number in ]i, 1[. We consider the quantitative 
languages L\ and L\ that assign the A-discounted sum of a's and fr's, respectively. 
Formally, given a (finite or infinite) word w = wqWi ■ ■ ■ £ S* U , let 



\w\ \w\ 

r a {w) = X 1 and rb(w) = A 1 

i\wi— a i\wi= b 

be the A-discounted sum of the a's (resp. 6's) of w. Then, L^(w) = r a (w) and 
Lfr(w) = rb(w). These languages arc definable by DDiSC. We show that the 
language L m = min(L^, L A ) is not definable by a NDiSC. 

Assume towards contradiction that there is a NDiSC A for L m . By Lemma 5 
and 6 in [CDH08], there exists an infinite word such that r a (w^) = r^w^). 

Since r Q (w^) + rt(w~ < ) = yrj, we have L m (w^) = 2 (i_a) an< ^ ^his * s * nc 
maximal value of a word in L rn (-). 

The maximal value in the automaton A can be obtained for a lasso-word 
of the form w) 1 .(w 2 )^ (where iui,u> 2 are finite words and w 2 is nonempty), as 
pure memoryless strategies exist in games over finite graphs with the objective 
to maximize the discounted sum of payoffs. Since the language of A is L m , the 
value of u>i.(w2) UJ is 2(1- a) ' anc ^ ^ nus r a{ w i-{ w 2) 0J ) = n>(wi.(u; 2 ) w ) by a similar 
argument as above. This last condition can be written as 

mj A" 1 -g a (A) . . X ni -g b (X) 
Pa( - } + 1 - A"^ = Pb{ ] + 1 - A" 2 

for some polynomials p a ,p b , q a , and integers tt-i > and n 2 > 0, or more 
simply as 

(l-A" 2 )-p(A) + A» 1 .g(A)=0 (1) 

for some polynomials p of degree n\ — 1 and q of degree n 2 — 1, all of whose 
coefficients are either 1 or — 1. Equation (1) is not identically zero as cither (i) 
n\ = and it reduces to q(X) = or (ii) m > and then p has degree at least 
so that the term of degree zero is not null in (1). 

Therefore, A must be algebraic, a contradiction. ■ 
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4.4 Closure under complement for infinite words 

Most of the weighted automata are not closed under complement. The next 
result is a direct extension of the boolean case. 

Theorem 19. The (non) deterministic Sup- and Li m I nf -automata, and the de- 
terministic LimSup- automata are not closed under complement. 

Proof. The result follows from a similar result for the boolean version of these 
classes. For DSup and NSup, consider the language L\ over S — {a, b} such that 
Li(a u ) = and L\[w) = 1 for all w ^ a u . For DLinf and NLinf, consider the 
language L^ over £ = {a, b} such that L2(S* .a u ) = 1 and L(w) = for all words 
w containing infinitely many 5's, and for DLsup, consider L3 the complement 
of La- ■ 

The next theorem is a positive result of closure under complementation for 
NLsup. It reduces to the complementation of nondctcrministic Buchi automata. 

Theorem 20. The nondeterministic LimSup- automata are closed under com- 
plement, with cost 0(m • 2™ lo s™). 

Proof. Let A = {Q, q , S, 8, 7) be a NLsup, and let V = {7(e) e <E 5} be the 
set of weights that appear in A. For each v £ V, it is easy to construct a NBW A v 
whose (boolean) language is the set of words w such that La(w) > v, by declaring 
to be accepting the edges with weight at least v. We then construct for each v EV 
a NBW A v (with accepting edges) that accepts the (boolean) complement of 
the language accepted by A v . Finally, assuming that V — {vi, . . . , v n } with 
vi < V2 < ■■■ < v n , we construct the NLsup Bi for i = 2, . . . ,n where Bi is 
obtained from A Vi by assigning weight —Vi—i to each accepting edges, and — v n 
to all the other edges. The complement of La is then max{is 2 , . . . ,Lb„} which 
is accepted by a NLsup by Theorem 12. ■ 

Theorem 21. The deterministic Disc- automata are closed under complement, 
with cost 0(n). 

Proof Sketch. It suffices to replace each weight v of a DDiSC by 1 — A — v 
(where A is the discount factor) to obtain the DDiSC for the complement. ■ 

Theorem 22. The deterministic \J\m/\\ig-automata are not closed under com- 
plement. 

Proof. Consider the DLavg A over alphabet £ = {a, b} (shown in Fig. 3) that 
consists of a single self-loop state with weight 1 for a and for b. Notice that 
La(w.o?) = 1 and La{w-^) = for all w G S*. To obtain a contradiction, 
assume that there exists a DLavg B whose language is Lb = 1 — La- For all 
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a, 1 




6,0 

Fig. 3. Deterministic Limit-average Automaton. 

finite words w 6 S* , let L^ s (w) be the average weight of the unique (finite) run 
of B over w. 

Fix < e < i. For all finite words w, there exists a number n w such that the 
average number of a's in w.b n ™ is at most e, and there exists a number m w such 
that L* vg ( w.a" lm ) < e (since Ls(w.a w ) = 0). Hence, we can construct a word 
to = o ,ll a mi &" 2 a m2 . . . such that L A (w) < e and L B (w) < e. Since L B = 1 - 
this implies that 1 < 2e, a contradiction. ■ 



Theorem 23. The nondeterministic LimAvg- and Disc- automata are not closed 
under complement. 

Proof. The fact that NLavg are not closed under complementation is as follows: 
it follows from Lemma 1 that the language L* = 1 — max{L a ,Lb} cannot be 
expressed as a NLavg, however, the language max{L a ,L;,} can be expressed 
as NLavg by Theorem 12. That NDiSC are not closed under complement can 
be obtained as follows: given < A < 1, consider the language L\ and h\ 
that assigns to words the A-discountcd sum of a's and 6's, respectively. The 
language L\ and can be expressed as DDiSC, and the max of them can 
be defined by NDiSC. Observe that L*(w) + L$(w) = jzj for all w G S u . 
Therefore, min{L^, } = j-K- — max{L^, L%}. Since NDiSC is not closed under 
min (Theorem 18), we immediately obtain that NDiSC are not closed under 
complementation. ■ 



4.5 Closure under sum for infinite words 

All weighted automata are closed under sum, except DLavg and NLavg. 

Theorem 24. The (non) deterministic Sup-automata are closed under sum, 
with cost 0(ni ■ mi ■ n-i ■ mi). 

Proof Sketch. The construction in the proof of Theorem 14 can be adapted as 
follows: define the weight y(((<?i, V\, V2), c, (q[, v[, q' 2 , v' 2 ))) as v[ + v' 2 for each 
((qi,vi,q2,v 2 ),o-, (g' 1> v' 1 ,g' 2 ,v! l )) e 5. ■ 
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Theorem 25. The nondeterministic LimSup- automata are closed under sum, 
with cost 0(n\ ■ mi ■ n 2 ■ m 2 ). 

Proof Sketch. Given two NLsup A\ and A 2 , we construct a NLsup A for the 
sum of their languages as follows. Initially, we make a guess of a pair (vi,v 2 ) 
of weights (vi in Aj, for i = 1,2) and we branch to a copy of the synchronized 
product of Ai and A 2 . We attach a bit b whose range is {1,2} to each state 
to remember that we expect A^ to visit the guessed weight Vb- Whenever this 
occurs, the bit b is set to 3 — b, and the weight of the transition is v\ + v 2 . 
All other transitions (i.e. when b is unchanged) have weight min{i>i + v 2 \ v\ £ 

v 1 Av 2 ev 2 }. m 



Theorem 26. The deterministic LimSup- automata are closed under sum, with 
costO{ ni -n 2 -2 m ^). 

Proof. Let Ai = (Qi,qj,I!,Si,^i) and A 2 = (Q 2 ,qj, S,8 2 ,^ 2 ) be two DLsup. 
We construct a DLsup A = (Q,qj, S,S,j) such that La — La 1 + La 2 - Let 
Vi = {71(e) I e £ Si} be the set of weights that appear in Ai (for i = 1, 2). The 
automaton A implements the synchronized product of A\ and A 2 , and keeps one 
bit b(vi,v 2 ) for each pair (vi,v 2 ) of weights V\ € V\ and v 2 £ V 2 . For i = 1,2, if 
b(v\, v 2 ) = i, then Aj is expected to cross a transition with weight u,-. Whenever 
this occurs, the bit is set to 3 — i. The weight of a transition in A is the largest 
value of v\ + v 2 such that the corresponding bit b(vi,v 2 ) has changed in the 
transition. Formally, we define: 

- Q = QixQ 2 x[V 1 xV 2 ^{l,2}}; 

- qi = (qj,qj,bi) where &r(t>i,i>2) = 1 for all (vi,v 2 ) £ V\ x V 2 ; 

- For each a £ S, the set S contains all the triples ((qi, q^, b), a, (q[, q' 2 , b')) 
such that (qi,a,qi) £ Si (i = 1,2), and for all (vi,v 2 ) £ V\ X V2, we have 
b'(v\,v 2 ) = 3 — b{v\,v 2 ) if Ji({qi, a, q'^f) = Vi for i — b{vi,v 2 ), and otherwise 
b'(vi,v 2 ) = b(vi,v 2 ). 

- jisde&nedbyj((q 1 ,q 2 ,b),a, (q[,q 2 ,b'}) = max({v min U{v 1 +v 2 \ b'(vi,v 2 ) ^ 
^(^1,^2)}) where v m \ n is the minimal weight in Vy + V 2 = {v\ + v 2 \ vi £ 
V 1 Av 2 £ V 2 }. 



Theorem 27. The (non) deterministic Limlnf -automata are closed under sum 
with cost 0{ni ■ n 2 ■ 2 mi m2 ). 

Proof. Let A\ = {Qi,q}, £, 8\, 71) and A 2 = (Q 2 , qj, S, S 2 , 72) be two DLinf. 
We construct a DLinf A = (Q,qi, S,S,j) such that La — La 1 + La 2 - Let 
Vi = {ji(e) I e £ 5i} be the set of weights that appear in Ai (for i = 1,2). The 
automaton A implements the synchronized product of Ai and A 2l and keeps one 
bit b(vi,v 2 ) for each pair (^1,^2) of weights v\ £ Vi and v 2 £ V 2 . If a transition 
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in Ai for some i £ {1,2} has weight less than «j, then the bit b(v\,v 2 ) is set to 
J_, otherwise is set to T. The weight of a transition in A is the largest value of 
V\ + 1>2 such that the corresponding bit b(i)i,vi) is T. Formally, we define: 

- Q = Qi x Q 2 x [Vl x V 2 {T, _L}]; 

- qi = (qj,Qj,bi) where b I (v 1 ,v 2 ) = ± for all (v 1 ,v 2 ) 6 Vi x V 2 ; 

- For each cr £ the set <5 contains all the triples ({q\,q 2 ,b) ,a, {q' 1: q' 2l b')) 
such that (qi,a,ql) £ Si (i = 1,2), and for all (ui,i>2) £ Vi x V2, we have 
b'(vi,v 2 ) = T if 7i((<?i, cr, <^)) > for z = 1,2, and otherwise V (1)1,1)2) = JL. 

- 7 is defined by 7((<?i,<j2, &),£>", {q'i,q' 2 ,b')) = max({v min U{v 1 +v 2 \ b'{vi,v 2 ) = 
T}) where w m i n is the minimal weight in V\ +V 2 = {y\ + v 2 \ v\ £ V\ /\ v 2 £ 
V 2 }. 

The result for NLinf follows from the fact that NLinf is reducible to DLinf. 



Theorem 28. The (non) deterministic Disc- automata are closed under sum, 
with cost 0(ni ■ n 2 ). 

Proof Sketch. It is easy to see that the synchronized product of two NDiSC 
(resp. DDisc) defines the sum of their languages, if the weight of a joint transi- 
tion is defined as the sum of the weights of the corresponding transitions in the 
two NDiSC (resp. DDisc). ■ 

Theorem 29. The (non) deterministic Lim Avg-automata are not closed under 
sum. 

Proof. Consider the alphabet £ = {a, b}, and consider the DLAVG-dcfinablc 
languages L a and Lb that assigns to each word w the long-run average number 
of a's and 6's in w respectively. Let L+ = L a + Lb- Assume that L+ is defined 
by a NLavg A with set of states Q (we assume w.l.o.g that every state in Q is 
reachable) . 

First, we claim that from every state q £ Q, there is a run of A over a)®' 
that visit a cycle C* with average weight 1. To see this, notice that from every 
state q £ Q, there is an infinite run p of A over a w whose value is 1 (since 
L + (w q ■ a w ) = 1 for all finite words w q ). Consider the following decomposition of 
p. Starting with an empty stack, we push the states of p onto the stack as soon 
as all the states on the stack arc different. If the next state is already on the 
stack, we pop all the states down to the repeated state thus removing a simple 
cycle of p. Let C\, C%, . . . be the cycles that are successively removed. Observe 
that the height of the stack is always at most \Q\. Let (3 be the largest average 
weight of the cycles C%, i > 1, and let a max be the largest weight in A. Assume 
towards contradiction that (3 < 1. Then, for all n > 0, the value of the prefix of 
length n of p is at most: 

«max-|Q|+/3-E^ll^l 

n 
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where k n is the number of cycles that have been removed from the stack when 
reading the first n symbols of p. Hence, the value of p is at most j3 < 1, which is 
a contradiction. Therefore, the average weight of some cycle C* = Ci is exactly 5 
1 (there are finitely many different cycles as they are simple cycles). Since the 
height of the stack is at most \Q\, the cycle C* is reachable in at most \Q\ steps. 

Second, it can be shown analogously that from every state q G Q, there is a 
run over 6'^' that visit a cycle C* with average weight 1. 

Third, for arbitrarily small e > 0, consider the word w and the run p of A over 
w generated inductively by the following procedure: wq is the empty word and 
Po is the initial state of A We generate Wi+i and pi+\ from Wi and pi as follows: 
(i) generate a long enough sequence w' i+1 of a's after iUj such that the average 
number of 6's in Wi ■ w' i+1 falls below e and we can continue pi and reach within 
at most \Q\ steps (and then repeat k times) a cycle C of average weight 1 and 
such that the average weight of this run prolonged by \Q\ arbitrary transitions 
is at least 1 — e, i.e. 

j(pt) + k ■ \C\ + 2a min ■ \Q\ 

\ Pi \ + k-\C\ + 2-\Q\ ~ 6 

where a m - m is the least weight in A. This is possible since k can be chosen 
arbitrarily large. Let p\ be the prolongation of pi over w' i+1 ; (ii) then generate 
a long enough sequence w'L j of 6's such that the average number of a's in 
Wi ■ w' i+1 ■ w'{ +1 falls below e and as above, we can construct a continuation p'( 
of p\ whose average weight is at least 1 — e (even if prolonged by |Q| arbitrary 
transitions); (iii) the word Wi+i = Wi ■ w' i+1 ■ w'{ +1 and the run pi+\ is p" . The 
word w and the run p are the limit of these sequences. We have L a {w) = Lb{w) = 
and thus L + (w) = 0, while the value of p is at least 1 — e, a contradiction. 
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